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Protein aggregation, a well-known culprit in human discasc,''- 
is also a major problem facing the use of proteins as therapeutic or 
diagnostic agents.'-'* Insights into the protein aggregation problem 
have been garnered from the study of natural proteins, where a 
relationship between solubility and net charge has been noted. For 
example, it is known that proteins arc least soluble at their isoelectric 
point, where they bear a net charge of zcro.^ More recently, small 
differences in net charge (±3 charge units) have been shown to 
predict aggregation tendencies among peptide variants.^ In addition, 
intrinsically disordered proteins,'-^ a class of proteins that are largely 
unfolded in the cell but that do not lead to pathological aggregation, 
tend to have large net charges.^- 

We speculated that the relationship between net charge and 
aggregation resistance might also be applicable to globular proteins, 
which can aggregate via partial unfolding induced by thermal 
agitation, chemical treatment, or conformational breathing. Receipt 
evidence that some proteins can tolerate significant changes in net 
charge (for example, the finding thai carbonic anhydrasc retains 
activity after exhaustive acelylation of its surface lysines") encour- 
aged us to test the hypothesis that the solubility and aggregation 
resistance of some proteins might be significantly enhanced, without 
abolishing their folding or function, by extensively mutating their 
surfaces to dramatically increase their net charge, a process we refer 
to as '"supercharging". 

We began with green nuorcscenl protein (GFP), an easily assayed 
protein that undergoes ehromophore maturation and becomes 
fluorescent only when folded correctly. To minimize the possibility 
that our GFP was unusually delicate and therefore unusually easy 
to improve, we used a starling GFP (slGFP) based on the stalc- 
of-lhc-arl GFP variant called ''supcrfoldcr'\ which has been highly 
optimized for folding robustness and resistance to denaturants.'- 
The net charge of the slGFP is -7, similar to that of wild-type 
GFP (—8). To create a superposilivc variant of GFP, wc idcntiHed 
29 positions in the crystal structure thai were highly solvent-exposed 
and mutated these lo positively charged amino acids (Lys and 
Arg), yielding a design with a iheoretical net charge of -|-36 (Figure 
1 and Supporting Information). Genes encoding slGFP and GFP- 
(+36) were constructed and expressed in coli, and both genes 
yielded intensely green fluorescent bacteria. Following protein 
purification, the fluorescence properties of GFP(+36) were mea- 
sured and found to be very similar to those of stGFP (Supporting 
Information). 

Encouraged by this finding, wc produced and characterized 
additional supercharged GFPs having net charges of +48, -25, 
and —30, all of which were also found to have stGFP-like 
fluorescence (Figures 2a and SI). All supercharged GFP variants 
exhibited circular dichroism spectra similar to that of stGFP, 
indicating that the proteins have similar secondary structure content 
(Figure 2b). The thermodynamic stabilities of the supercharged GFP 
variants were only modestly lower than that of stGFP (1.0—4.1 
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Figure 1. Supercharged green nuorescenl proicins (GFPs). (a) Protein 
sequences of GFP variants, with tluorophore-forming residues highlighted 
green, negatively charged residues highlighted reel, and positively charged 
residues highlighted blue, (b) Flectroslaiic surlace poientials of ihe starting 
GFP (StGFP), GFP(+36). and GFP(-30). colored from -25 kT/e (red) to 
+25 kT/e (blue). 



kcal/mol, Figure 2c and Table SI) despite the presence of as many 
as 36 mutations. 

We next examined the effect of supercharging on aggregation 
resistance. Although stGFP is the product of a long history of GFP 
optimization,'-' like most proteins it can be induced to aggregate 
by thermal or chemical unfolding. Heating stGFP to 100 °C induced 
its quantitative precipitation and the irreversible loss of fluorescence 
(Figures 3a and S2). In contrast, supercharged GFP(+36) and GFP- 
(—30) remained soluble when heated to 100 °C and recovered 
significant fluorescence (62 and 28% of their initial fluorescence, 
respectively) upon cooling (Figure 3a). When mixed with 40% TFE, 
an additive commonly used to examine chemically induced protein 
aggregation, stGFP lost all lluoresccncc and began to visibly 
aggregate (Figures 3b and S3). Within half an hour at 25 °C, 
aggregation of stGFP was complete as followed by right-angle light 
scattering. The +36 and -30 supercharged GFP variants also 
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Figure 2. Intramolecular properties of GFP variants, (a) Staining and UV fluorescence of purified GFP variants. Each lane and tube contains 
0.2 /fg of protein, (b) Circular dichroism spectra of GFP variants, (c) Thermodynamic stability of GFP variants, measured by guanidinium-induced 
unfolding. 



became completely non fluorescent when exposed to 40% TFE; 
however, in contrast to stGFP, they suffered no significant 
aggregation under the same conditions even after several hours 
(Figure 3b). Taken together, the results of these studies indicate 
that supercharged GFPs, upon thermal or chemical denaturation, 
remain entirely soluble. 

In addition to this aggregation resistance, supercharged GFP 
variants show a strong, reversible avidity for highly charged 
macromoleeules of the opposite charge (Figures 3c and S4). When 
mixed together in 1:1 stoichiometry, GFP(+36) and GFP(-30) 
immediately formed a green fluorescent co-precipitate, indicating 
the association of folded, functional proteins. GFP(+36) similarly 
co-precipitated with high concentrations of RNA or DNA. The 
addition of NaCl was sufficient to dissolve these complexes, 
consistent with the electrostatic basis of their formation. In contrast, 
StGFP was unaffected by the addition of GFP(-30), RNA, or DNA 
(Figure 3c). 

Because the robust folding of supcrfoldcr GFP together with its 
monomeric nature may have contributed to the ability of stGFP to 
be supercharged, we next sought to detei-mine whether the 
supercharging principle could apply to proteins other than GFP, 
including non-monomeric proteins with binding or catalytic activi- 
ties. To this end, we applied the supercharging process to two 
proteins unrelated to GFP. 

Strcptavidin is a tctramcr with a total net charge of -4. Using 
an entirely automated version of our mutagenesis strategy (Sup- 
porting Information), we designed two supercharged strcptavidin 
variants with theoretical net charges of —40 or +52 and expressed 
and purified the proteins (Figure S5). Both were capable of forming 
ictramcrs, as judged by analyiica! size-exclusion chromatography 
(Figure SIO), and both had signiHcanl, albeit greatly reduced, biotin- 
binding capacity (Figure SI 1). Like the supercharged GFPs, the 
supercharged streplavidins were resistant to thermally induced 
aggregation and undei-went no significant aggregation after boiling; 
in contrast, wild-type strcptavidin was 93% aggregated after heating 
to 100 °C for 1 min (Figure S7 and Table SI). 

Glutathionc-S-transferasc (GST), a dimcr with a total net charge 
of +2, was supercharged to yield a dimer with net charge of —40 
(Figure S5). Purified GST(~40) was able to form dimers (Figure 

58) and exhibited comparable Ku and only 3-fold lower Avai than 
that of the wild-type enzyme (wtGST: = 0.3 db 0. 1 mM, kc,u 
= 46 s"'; GST(-40): = 0.25 ± 0.05 mM, A',,, = 16 s-') (Figure 

59) . Moreover, supercharged GST remained soluble when heated 
to 100 °C, in contrast to its wiid-typc counterpart which, like stGFP 
and wild-type strcptavidin, precipitated quantitatively and irrevers- 
ibly (Figure S7). In addition, GST{-40) recovered 40% of its 
catalytic activity upon cooling (Figure 3d), unlike wtGST, which 
was irreversibly aggregated and had no delectable residual activity 
after cooling. 



In summary, we have demonstrated that monomeric and multi- 

mcric proteins of varying structures and functions can be "super- 
charged" by simply replacing their most solvent-exposed residues 
with like-charged amino acids. Supercharging greatly alters the 
intermolecular properties of proteins, imparting aggregation resis- 
tance and the ability to associate in folded form with oppositely 
charged macromoleeules like "molecular Velcro" (Figure 3c). We 
note that these unusual inteiTnoleeuIar properties arise from high 
net charge, rather than from the total number of charged amino 
acids, which was not significantly changed by the supercharging 
process (Table SI). 

In contrast to the substantial intermolecular effects we observed, 
the intramolecular properties of the seven supercharged proteins 
studied here were largely intact: specifically, the supercharged 
GFPs retained the ability to assume a fiuorcsccnt native-like state; 
the supercharged GST remained dimeric and retained catalytic 
activity; and the supercharged streptavidins retained some biotin 
affinity and their propensity to form tetramers. Supercharging 
therefore may represent a useful approach for reducing the 
aggregation tendency and improving the solubility of some proteins 
without abolishing their function. 

The high charge density on the surface of the supercharged 
proteins is likely to perturb the pK^ values of some of the charged 
residues, potentially lowering the actual magnitude of net charge. 
We found some evidence for this effect: for example, GST(-40), 
with a predicted pi of 4.8, migrated as a protein with a pi of 5.5 on 
an isoelectric focusing (lEF) gel (Figure S6); however, GST(+36), 
with a predicted pi of 10.4, indeed failed to migrate into an lEF 
gel wilh a nominal pi range of 3 - 10. Further, not all supercharged 
proteins are readily accessible. An early design for a supcrncgative 
GFP did not yield fiuorcscent protein, and a gene encoding a 
supcrpositivc GST (Supporting Information) failed to express in 
E. coli. 

Protein supercharging illustrates the plasticity of protein 
surfaces and highlights the opportunities that arise from the 
mutational tolerance of solvent-exposed residues. For example, it 
was recently shown that the thermodynamic stability of some 
proteins can be enhanced by rationally engineering charge-charge 
interactions,''' and that introducing a positively charged patch of 
amino acids onto the side of GFP endowed the protein with cell 
permeability.'-*' Protein supercharging demonstrates how this plastic- 
ity can be exploited in a different way to impart robust resistance 
to protein aggregation. Our findings arc consistent with the results 
of a eomplementai-y study in which removal of all charges from 
ubiquitin did not prevent folding but significantly impaired its 
solubility.'^' 

The principles revealed here may be particularly useful in de 
novo protein design elTorts, where unpredictable protein handling 
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properties including aggregation remain a significant challenge.'^ 
In light of the above results of supercharging natural proteins, it is 
tempting to speculate that the aggregation resistance of designed 
proteins could also be improved by biasing the design process to 
increase the frequency of Hkc-chargcd amino acids at positions 
predicted to lie on the outside of the folded protein. 

These observations may also illuminate the modest net-charge 
distribution of natural proteins.' ^ '^ The net charge of 84% of 
Protein Data Bank (PDB) polypeptides, for example, falls within 
±10. Our results argue against the hypothesis that high net charge 
creates sufficient electrostatic repulsion to force unfolding. 
Indeed, GFP(+48) has a higher positive net charge than any 
polypeptide cun-cntiy in the PDB, yet retains the ability to fold 
and fluoresce. Supercharged proteins have a charge distribution 
reminiscent of micelles, which arc stabilized by colloidal forces 
that arc able to overcome charge— charge repulsion.-^ Instead, our 
findings suggest that nonspecific intcrmolccular adhesions may 
have disfavored the evolution of too many highly charged 
natural proteins. Consistent with this hypothesis, almost all 
natural proteins with vcvy high net charge, such as ribosomal 
proteins L3 (+36) and LI 5 (+44), which bind RNA, or cal- 
scqucstrin (-80), which binds calcium cations, associate with 
oppositely charged species as part of their essential cellular 
functions. 
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Figure 3. (n-c) liiicrmoleciilar properties ofsiiperchargcd GFPs. (a) UV- 
illiiminaied samples of pitrified GFP variants ("native"), those samples 
heiUed I iiiin <)i 100 T ("boiled"), and iliose samples subsequently cooled 
for 2 h at 25 °C ("cooled"), (h) Aggregation of GFP variants was induced 
with 40% TFE at 25 °C and monitored by right-angle light scattering, (c) 
Supercharged GFPs adhere reversibly to oppositely charged macromolecules, 
Sample 1: 6 iig orGFP(+36) in 30 /fL of 25 mM Tris pH 7.0 and 100 
mM NaCI. Sample 2: 6 /(g of OF P(- 30) added to sample 1. Sample 3: 30 
«g of salmon sperm DNA added to sample 1. Sample 4: 20 /tg of/:. coH 
tRNA added to sample I. Sample 5: addition oTNaCl lo 1 M of sample 4. 
Samples 6—8: identical to samples I, 2, and 4, respectively, except using 
StGFP instead of GFP(+36). All samples were spun briefly in a microcen- 
trifuge and visualized under UV light, (d) Enzymatic activity of GST 
variants. Reactions contained 0.1 mg/mL of GST variant, 4 mM chloro- 
dinilrobenzene, 4 niM glutathione, and 100 niM potassium phosphate pH 
6.5. Product formation was monitored at 340 nm, yielding reaction rates 
(vinii) of 1 .0 //mol/inin for wild-type GST, 0.36 //mol/min for GST(-40), 
and 0.15 /nnol/min for GST(~40) after being boiled and cooled. 
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